A Random Loop Model for Long Polymers 



Manfred BohrE and Dieter W. Heermann 
Institute of Theoretical Physics, University of Heidelberg, 
Philosophenweg 19, D-69120 Heidelberg, Germany 

Roel van Driel 

Swammerdam Institute for Life Sciences, University of Amsterdam, 
BioCentrum Amsterdam, Kruislaan 318, 1098 SM Amsterdam, The Netherlands 

(Dated: February 1, 2008) 

Remarkably little is known about the higher-order folding motifs of the chromatin fibre inside 
the cell nucleus. Folding depends among others on local gene density and transcriptional activity 
and plays an important role in gene regulation. Strikingly, at fibre lengths above 5 to 10 Mb the 
measured mean square distance (R 2 ) between any two points on the chromatin fibre is independent 
of polymer length. We propose a polymer model that can explain this levelling-off by means of 
random looping. We derive an analytical expression for the mean square displacement between two 
arbitrary beads. Here the average is taken over the thermal ensemble with a fixed but random 
loop configuration, while quenched averaging over the ensemble of different loop configurations - 
which turns out to be equivalent to averaging over an ensemble of random matrices - is performed 
numerically. A detailed investigation of this model shows that loops on all scales are necessary to 
fit experimental data. 
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I. INTRODUCTION 

The folding behaviour of DNA inside the cell nucleus 
has been subject to extensive studies. The diploid human 
genome consists of about 2 m of double stranded DNA, 
which is wrapped around histone protein octamers, form- 
ing nucleosomes every 180 to 250 bp. This chromatin 
fibre has a length of about 50 cm and is packaged in- 
side a cell nucleus of about 10 /im diameter. Despite 
its tight folding, the DNA is readily accessible to large 
numbers of proteins required for transcription, replica- 
tion and DNA repair. The first stages of compaction are 
well-known (see ref. [l[ for a review): The DNA double 
strand is wrapped around histone octamers in a 1-and- 
3/4 left-handed super helical turn forming the so-called 
nucleosomes, which are connected by stretches of linker 
DNA. This nucleosome-linker complex has a diameter of 
about 10 nm and is called 10 nm chromatin fibre. It has 
been shown in vitro that this fibre in turn condenses to 
a fibre of 30 nm diameter, whose structure is still under 
discussion @, H, 0]. 

Remarkably little is known about the higher-order fold- 
ing of the chromatin fibre inside the human interphase 
nucleus || . It is not possible to follow the chromatin fibre 
in the interphase nucleus by imaging techniques. There- 
fore indirect approaches are being used to study chro- 
matin folding. One method that has been applied by sev- 
eral groups is fluorescence in situ hybridization (FISH). 
By labelling chromatin in the intact cell with pairs of 
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FISH -markers one can measure the mean squared physi- 
cal distance (R 2 ) between these markers as a function of 
the genomic distance d. A recent study has shed light on 
the folding at larger length scales Q. It was shown that 
there is a levelling-off in the physical vs. genomic dis- 
tance plot at genomic distances longer than a few mega 
base pairs (Mb), so that approximately (R 2 ) ~ 0(1) for 
large d. At shorter length scales, i.e. up to a few Mb, 
the folding behaviour is best described within the globu- 
lar state model, where the mean square physical distance 
scales like 

(R 2 ) ~ d 2 / 3 . (1) 

The unexpected behaviour at larger genomic distances 
called for further explanation. 

In the past, there have been several attempts to ex- 
plain the folding motifs of chromatin in interphase cells 
using polymer models @, i, ©, [l(J. One approach has 
been to model the chromatin fibre as a random walk in 
confined geometry Q • Other data have been interpreted 
in terms of a Random- Walk/Giant-Loop (RWGL) model 
suggesting that there is a random walk backbone with 
regularly placed loops of about 3 Mb [1, Q to explain dif- 
ferent observed folding regimes on large and small scales. 
Another model, the Multi-Loop-Subcompartment (MLS) 
model [HI [H|, proposes the existence of a rosette-like 
structure with 120 kb loops. None of these models pre- 
sented so far is able to explain the levelling-off observed 
in the recent study. 

It has been shown that the formation of loops of differ- 
ent sizes plays an important role in gene regulation and 
gene expression. One system that has been extensively 
studied is the /3-globin locus [HI, Here chromatin 
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FIG. 1: The Random Loop Model averages over (a) the ther- 
mal disorder and (b) over the possible configurations of loops. 
Here one can see two possible configurations of loops. 



expression is controlled by the formation of loops bring- 
ing different regulatory elements of the locus in physical 
contact. In this system the loop sizes are in the order of 
10 kb. The formation of these loops is dynamic: different 
genes in the locus interact with the control locus during 
development in a mutually exclusive way, correlated with 
their expression. Loops that link promoter and enhancer 
complexes have been found of up to 3 Mb [3, 0j| . Loops 
of up to several tens Mb have been associated with the 
formation of transcription factories, which bring together 
transcriptionally active genes [l5|, [T6[. In all cases, chro- 
matin loop formation is a dynamic process (l7j . 



Based on these observations we propose a general poly- 
mer model which is able to explain the levelling-off be- 
haviour observed in experiment where the mean squared 
physical distance scales like (R 2 ) ~ 0(1) at genomic dis- 
tances above 1-2 Mb. The model takes into account the 
looping of the polymer, i.e. the chromatin fibre. The 
backbone of our polymer is formed by a random walk 
chain. For reasons of mathematical tractability we do 
not introduce excluded volume interactions. While Sachs 
et al. assumed loops of uniform size and fixed positions 
of the loop attachment points [g| , we allow loops to have 
random polymer length and attachment points. We are 
interested in the average conformational properties of the 
system, mainly the mean square distance between two 
beads of the chain. We derive an analytical result for 
the average over the thermal disorder of one specific con- 
figuration of loops. The randomness of the loops gives 
rise to average over all different loop configurations. Nu- 
merical methods are used to get the quenched average 
over the different configurations of loops (see fig. [1] for 
two possible configurations). This average corresponds 
to the cell-to-cell variation of the measurements. We 
compare our model to the experimental data and investi- 
gate which loops are necessary to get a levelling-off where 
(R 2 )~0(1). 



II. THEORY 
A. Polymer models 

In the past, experimental data from FISH measure- 
ments has been compared quite successfully to polymer 
models. Mainly three basic polymer models are used: 
(a) the random walk (RW) model where it is assumed 
that the orientations between chain segments are com- 
pletely uncorrelated and no volume interactions are taken 
into account, (b) the self-avoiding walk model (SAW) 
takes excluded volume into account, while (c) the globu- 
lar state (GS) model furthermore includes temperature- 
dependent attractive interactions (cf . ref . [HI, [l9[ ) . One 
characteristic feature of a polymer model is the mean 
squared end-to-end distance (R%)- For the basic models 
above, the end-to-end distance scales like 

(R%) = b 2 N 2 » (2) 

in the limit of large N, where I is the linker length, N the 
chain length and v a constant depending on the model 
used: v = 0.5 for the RW, v w 0.588 for the SAW and 
v = 1/3 for the GS. Assuming that eq. © also holds 
for intra-chain distances in the limit of large N, one can 
compare the experimental data to these models @. Re- 
lation ^ is no longer valid as soon as we introduce long- 
range interactions along the chain, such as chromatin 
looping. 

Polymer modelling makes use of a coarse-graining ap- 
proach, i.e. we divide the chromatin fibre into N equal 
subunits of length b. The subunits itself are assumed to 
be uncorrelated, i.e. they can freely rotate around each 
other. Such a description of a biological polymer is cor- 
rect when we make N sufficient small so that b is larger 
than the persistence length of chromatin, i.e., the bend- 
ing energy vanishes on this length scale. 

B. General expression for the average over the 
thermal ensemble 

To set up our Random Loop Model we first derive 
a general expression for the mean square distance be- 
tween two arbitrary beads of the chain where harmonic 
interactions between all beads of the chain are allowed. 
The spatial positions of the chain's beads are denoted by 
xo, . . . ,xjv and Xj = {xi,yi, z.;). For reasons of mathe- 
matical tractability we consider a phantom chain (ran- 
dom walk) where no excluded volume is taken into ac- 
count. The Gaussian chain that forms the backbone of 
our model is characterized by the following potential, 

k N 

t^Gaussian = ^ || Xj — Xj_i || 

i=i 

In addition to the random walk backbone, we allow each 
bead to interact with any other via harmonic potential, 
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resulting in an interaction term in the potential 



U = U Ga 



N 

E < 

i<J 
i-j'l>l 



where the spring constants for the loop 

attachment points. The spring constants k^- are given 
here and in the following in reduced units (comprising 
the term 1/kgT). Right now we keep them arbitrary 
but they will be randomly chosen later within the model. 
This potential has already been proposed by Sachs et 
al. @ but not been elaborated further. It can be rewrit- 
ten in the form 



Kij . In the following we only assume that K is & symmet- 
ric and regular matrix. So P\ in eq. ([5]) turns out to be 
a multivariate normal distribution with mean \x = and 
covariance matrix X = K^ 1 . The marginal distribution 
for two arbitrary beads I and J, 



P(xi,xj) 



N 



n p ^ 



,x N ) (7) 



i=l 
i^I,J 



can be evaluated by standard methods for normal distri- 
butions. Going back to three dimensions we obtain after 
some basic integral evaluations the joint probability den- 
sity for the distance between two beads I and J, 



1 N 

i<j 



-x,- = - 



N 

£< 

i,3=0 



(3) 



where = k for \i — j\ = 1. 

The probability density for a bead conformation 
(xo, . . . , xjv) in the canonical ensemble is given by the 
Boltzmann factor 



P(x , . . .xjv) = C exp 



U 



(4) 



where C is a normalization constant and U = 
t/(xo, . . . x/v) is the total potential energy of the chain. 

We now eliminate the degrees of freedom stemming 
from the translational invariance of the problem by set- 
ting xo = (the absolute position of the chain in space 
is irrelevant for distances between beads). 

Due to the Gaussian character of the probability den- 
sity G, the spatial dimensions factorize, 



P(xi, . . .,xjv) = 

Pi(xi, . . . ,x N ) ■ Pi(yi, 



,y N ) ■ Pi{zx, ...,z N ) 



and we can concentrate on the one-dimensional density 
function Pi. By an easy calculation omitted here, we 
can rewrite the one-dimensional probability density Pi 
as follows 



P(|| xj-xj ||) = P(ru) 



Cr]j exp 



1 



1 



2 ctjj + an - 2au 



u 



Here 



k- 1 = K) 



and C is the normalization constant. Using 



1 



1 



2 a j j + an — 2a ij 
and calculating the correct normalization we obtain 
4 



P(ru) 



I^ r 2 jj exp [-Tfjj] (8) 



and finally 



r 2 ) 
*-J I thermal 



J r*jP{rij)dru 



— = 3(ajj + a H - 2a TJ ) . (9) 



The bracket delimiters here denote the average over 
the thermal ensemble of N + 1 beads interacting via a 
given, but fixed harmonic potential. 



PiOci, . . . ,x N ) = d exp(--X T i^X) 
where X = [xx, . . . ,xjy) T and 



(5) 



K = 



2_^j=Q K lj — Kl2 

EN 

j¥2 



V 



-KJV1 



-KN2 



-Ki N 

-K2N 



(6) 



3^N 



Up to now we have not made any assumptions concern- 
ing the spring constants (i.e. basically the matrix entries) 



C. The Random Loop Model 

The last section was dedicated to the derivation of a 
general formula for the mean square displacement be- 
tween two arbitrary beads of the chain where each bead 
may interact with any other via harmonic potential. This 
quantity turned out only to depend on the matrix K, or 
more accurately speaking, on its inverse. The matrix K 
contains all information about the interactions. Now we 
want to specify this matrix. Our model assumes the chro- 
matin fibre to have a random walk backbone, meaning 
that with \i— j\ = 1. Furthermore the chromatin 

forms loops whose size and positions are randomly dis- 
tributed along the chain. On a more general footing we 
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can restrict the possible loop sizes £ to a certain range 
Within this range all loops are chosen randomly 
by setting: 

{k with probability V . . 

, if h < k- 1 < /■ 
with probability l-V 

Kij = otherwise 

Note that we can set k = 1 as it only scales the mean 
square displacement in eq. ((9]). Thus our model has two 
adjustable parameters, namely the chain length N and 
the probability V . 

The resulting matrices K represent an ensemble of di- 
agonally dominated band random matrices and each ma- 
trix of this ensemble represents a loop configuration. This 
ensemble of random matrices has been investigated re- 
cently [20j • We are interested in the ensemble average of 
the mean square displacement, i.e. in the quantity 

thermal ) loops 

= 3 (V./j) loops + <CT//) loops - 2 (er/j) loops ) 

This average is a quenched average and is equivalent to 
averaging over the ensemble of random matrices given by 
the above constraints. In sec. IIII Bl we also consider the 
case of the annealed ensemble and give an explanation 
why we use the quenched one here. 

As our model already assumes that the chromatin fibre 
is translational invariant (as we do not take into account 
genomic content), we are only interested in the mean 
square displacement (R^) for two beads separated by 
n = \i- j\. 

The average over the ensemble of random matrices can- 
not be performed analytically, so we have to use a rep- 
resentative subset of the ensemble and numerically cal- 
culate the inverse matrix and thereby the mean square 
displacement. 

III. RESULTS 

A. Comparison to experimental data 

As noted earlier, our polymer model makes use of 
coarse-graining, since it is impossible to model such a 
long fibre in detail. Restrictions are given by comput- 
ing time, which basically depends on the size of the ma- 
trix K . For our calculations we chose a matrix size of 
N = 1000 as a good compromise between computing time 
and not too coarse graining. Using a coarse-graining ap- 
proach implies that we neglect details on a scale below 
the effective segment length being 150 kb in the follow- 
ing figures. Therefore we cannot resolve those loops that 
have been investigated in some gene-expression systems 
like the /3-globin locus. As we are interested in large scale 
chromatin organization, it is justified to neglect these 
loops as they have no effect for the levelling-off at large 



genomic distances, but only lead to a rescaling of the 
effective Kuhn length. 

The first point of interest is which loops are neces- 
sary for the observed experimental behaviour. Do small 
loops (in the order of 100 kb to 1 Mb) already lead to 
the levelling-off, or are loops on all scales up to 80 Mb 
needed? Restricting the loop sizes to a range £ £ [1, a] 
only leads to a rescaling of the effective segment length, 
we still have (R%) ~ n (fig. Hk). In fig- 12b) we ana- 
lyzed the ensemble where only large loops are allowed. 
Obviously, large loops are responsible for forcing the col- 
lapse of the chain, but the overall behaviour of the mean 
square physical distance does not fit the experimental 
data. Therefore loops on all scales are needed to obtain 
the levelling-off observed in experiment. 

The characteristic features of the mean square dis- 
placement allowing loops on all scales can be seen in 
fig-Hfc)- At short contour lengths the mean square dis- 
placement grows similar to a random walk, but soon a 
levelling-off can be observed due to the attractive long- 
range interactions which is fairly ~ 0(1). While the con- 
tour length approaches N , the mean square displacement 
again rises to a random walk like behaviour. This is a 
chain end effect which is not of interest to us, as experi- 
ments only measure intra-chain distances. It is due to the 
construction of the loops, as the probability for having a 
loop with a larger size becomes increasingly small. 

Thus, adding long-range interactions forcing the poly- 
mer to form loops yields completely different traits than 
a simple random walk or self-avoiding walk model. Note 
that the probabilities V are chosen very small, meaning 
that a few loops suffice to obtain this levelling-off. The 
number of independent randomly choosen entries is 
C = (N - 1) (N - 2)/2 for a N x TV-matrix and therefore 
the average number of loops per configuration is given 
by C ■ V. With P = 4x 10~ 5 and N = 1000 one has an 
average of about 20 loops. 

In fig. [3] the model is compared to the experimental 
data for different values of V . Here one has to introduce 
two new scaling parameters, the segment length in phys- 
ical units (e.g. nm) and the segment length in base pairs. 
The data is shown for a segment length of 300 nm and 
150 kb. The latter is the size of the flourescent mark- 
ers used in experiments, therefore it does not make any 
sense to model on a more detailed scale. As mentioned 
above, using this coarse-graining approach all details on 
length scales smaller than 150 kb are neglected. The 
model can quite well explain the levelling-off at genomic 
distances above a few mega-base pairs as well as the rise 
at small genomic distances. As we have shown that on 
small genomic distances we have a globular-state-like be- 
haviour Q , this random- walk-based model does not yield 
perfect results here. 

In a recent publication Q we already mentioned that 
plotting (Rn) versus n is not a very sensitive method 
to check for the correctness of a model. Looking at the 
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FIG. 2: Mean square displacement between two chain seg- 
ments in relation to their contour length for the Random 
Loop Model for different allowed loop sizes. The chain length 
is always N = 1000 In figure a) only loops smaller than a 
certain size s are allowed. The basic scaling behaviour is still 
(Rn) ~ n with a changed effective contour length compared 
to the free random walk. For this plot V was chosen that 
the average number of loops per configuration is 100. Figure 
b) is for large loops where only loops of sizes I in a range 
[N — s, N] are allowed. While large loops seem to be respon- 
sible for the collapse of the chain, they alone cannot explain 
the experimental data. As in a) V was chosen that the mean 
number of loops per configuration is 100. Figure c) shows the 
results for the situation where loops of all sizes are allowed. 
The levelling-off to ~ O(l) can already be achieved by 

a small number of loops 
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FIG. 3: Experimental data compared to the Random Loop 
Model. The data is taken from and includes short and 
long distance sets. The results of the Random Loop Model 
are shown for N = 1000 and different values of V. 



cumulant relation between higher-order moments, 



c 4 



(R 2 



(10) 



gives much stronger evidence, as this expression is re- 
lated to the distribution of the distances and not only 
its average value. Furthermore it has the advantage that 
the physical length scale divides out. Eq. (JTUJ) can be 
easily evaluated for a Gaussian Chain, where t^ w = 5/3. 
To obtain the value of the cumulant relation for a self- 
avoiding walk one has to use the expression for the 
distance probability density obtained by scaling argu- 
ments ilii, 



Psaw(Rn) 



AR% +2 exp 



-DR 



N 



/i = 0.28 



(11) 

Numerical integration gives c| AW w 1.506, so both RW 
and SAW yield a constant expression. Fig. 2] shows that 
this constant has a value significantly below the fluctu- 
ations of the data. Here our model is in better agree- 
ment with experiments. We should point out here the 
importance of averaging over the disorder of loops. The 
cumulant expression C4 only averaged over the thermal 
ensemble given by equations ([5]) and @ is the same as 
for a pure random walk, namely 5/3. It is the average 
over the loop configurations that changes this behaviour, 
bringing it in better agreement with the data. 



B. Annealed vs. quenched average 

In systems with disorder one has to perform averages 
both over a set of statistical variables and over a set of 
random variables representing the disorder [23|. In the 
case of the Random Loop Model the partition sum Z 
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FIG. 4: The cumulant expression a = (R 4 )/(R 2 ) of the 
experimental data compared to SAW, RW (straight lines) and 
Random Loop Model (dashed lines). For the legend, see fig. [3] 
The data shows significant differences to a RW and a SAW 
(horizontal lines), and is fluctuating around the cumulant of 
the Random Loop Model. 



depends on the statistical variables xi, . . . , xjv and a set 
of random variables representing the disorder {/%} with 
\i — j\ > 1. Above we have performed the average first 
over the statistical variables, while the average over the 
disorder has been taken over the quantity (rij) 2 . This 
corresponds to a process of quenched averaging. Using 
this method we assume that the time the cell needs to 
go into a new disorder configuration is much longer than 
the time needed for the cell to come into thermal equi- 
librium, T cq <C Tdis- At least to our knowledge, such time 
scales are not known inside the cell nucleus. Time evolu- 
tion measurements cannot be performed on a cell using 
FISH markers as the cell has to be fixated before ap- 
plying imaging techniques. Different configurations can 
only be observed by looking at different cells. It seems 
to us more reasonable to use a quenched type of average 
for comparison to biological data, as the loops are func- 
tional complexes which have to persist a while in order 
to properly fulfill their tasks. 

Nevertheless, it is interesting to consider the other 
case, the annealed ensemble. This type of average should 
be applied if r^s <C r cq . The annealed average of the par- 
tition sum can be written as 



(Z) 



E 

} 



2({Xfc},{/Sij}) p({Kij}) 



(12) 



where the sum is over all possible configurations of dis- 
order, and p is the probability of one such configuration. 
We restrict our calculation to the case that loops of all 
sizes are allowed and that the spring constant is k for 
adjacent beads as well as for loops. Assuming that the 
i^ij are 1.1. d. Bernoulli as before, the average over the 



disorder can be carried out exactly, 
dxi . . . dx N exp(— C/q a 



jaussian J 

I ] \V (e-5«ll x i-*iii - 1 ) + 1 
i<j-l 

Introducing the effective potential 

, N-l 



i=0 |i-j|>l 



where =|j Xj — Xj || 2 , we can rewrite the partition sum 



(ixi . . . (ixjv exp(— U c g) 



(13) 



The effective potential has two parts: Adjacent beads 
with \i — j\ — 1 keep their attractive harmonic potential, 
while all non- adjacent beads interact via a pairwise at- 
tractive potential V(r). This potential is characterized 
by a minimum at r = 0, while for large r it reaches a 
plateau at V(r — ► oo) = — log(l — V). In a low tempera- 
ture approximation, a series expansion around r = up 
to second order gives 



V(r) = ^Vnr 2 



(14) 



- a harmonic potential with effective spring constant Vk. 

The partition sum in (|13p cannot be evaluated analyt- 
ically and therefore we do not obtain an expression for 
the mean square displacement in the annealed case. One 
could obtain results using extensive and time-consuming 
MD or MC simulations. It will be left for future investi- 
gations. 



C. Limiting cases without disorder 

In most cases one cannot solve the model presented 
above analytically. Using the quenched ensemble one 
cannot calculate the average over the disorder, while us- 
ing the annealed ensemble one cannot obtain the par- 
tition sum after having performed the disorder average. 
Therefore we calculated sample averages for the quenched 
case above. There are two special cases where the model 
can be solved exactly. These are the limiting cases where 
no disorder is present. V = is the situation of a normal 
Gaussian Chain with spring constant k. It is well known 
that the mean square distance between two beads sepa- 
rated by n monomers is given by (R^) = —n. The other 
limit, V = 1, corresponds to a fully connected network 
of beads. Assuming that all beads interact with spring 
constant n, we can solve this problem analytically. Here 
we basically do not deal with a linear chain any more. 
The interaction matrix K = (fcy )i.j in this case writes 



for i = j 
for i 7^ j 



(15) 
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By an easy calculation one can show that the inverse 
matrix is given by 



= J (N + 1)K 



for 
for 



i = J 
i 7^ 3 



(16) 



Recall our definition of the chain at the beginning of 
sec. Ill Bl Although we have an N x iV-matrix our chain 
has N+l beads, as we set x = 0. Inserting into eq. © 
yields 



{N + 1)k/2 



(17) 



Within this system two beads are interacting with an 
effective harmonic potential with K e g = (N + l)re/2. 

Of major interest is the case where V = 1, but where 
adjacent beads interact with a different spring constant 
than loops, i.e. = k for \i — j\ = 1 and Kij — k for 
\i — j | > 1 . We were not able to solve this case analyti- 
cally. One might take this system as a model for the low- 
temperature limit of the annealed case in eq. (|14p where 
k is replaced by an effective interaction k — Vk. On a 
more general footing this case might also be regarded as 
a model for a system where the random attraction with 
probability V and loop spring constant k has been re- 
placed by an average attraction with probability V = 1 
and loop spring constant Vk. It is clear a priori that 
such a potential will lead to a collapse of the chain, as all 
beads are interconnected. In fig. [5] we chose k = k = 1 
and V = 4 x 1CP 5 as the reference curve. In comparison 
with the case of average attraction (V = 1, ac = 1, k = V) 
the levelling-off is much less pronounced. Of course it is 
possible to come into close agreement with the reference 
curve by choosing another interaction constant. For our 
reference curve one would have to lower k by about one 
order of magnitude, corresponding to V ~ 2 x 10~ 6 ( < 1 
loop per chain!). Although one could fit the data with 
these averaged attraction potential, we see no biological 
reason for such a potential to exist in the cell. 



IV. CONCLUSIONS 

In this paper we present a polymer model for the fold- 
ing of the chromatin fibre in the interphase nucleus, based 
on recent experimental results @. These show that the 
chromatin fibre inside the human cell nucleus is ruled 
by two different folding regimes: At small genomic dis- 
tances, i.e. below ~ 3 Mb the data can be explained 
well with a globular state polymer. At larger genomic 
distances there is a levelling-off to a scaling behaviour 
(R 2 ) ~ 0(1). This indicates the presence of long-range 
interactions. In agreement with the experimental find- 
ings we propose the existence of loops to be responsible 
for this levelling-off. The main aspect of our model is the 
assumption of loops of various sizes and positions, where 
we average not only over the thermal ensemble but also 




100 200 300 400 500 600 700 800 900 1000 
contour length n 

FIG. 5: The Random Loop Model (RLM) compared to a 
system where the random attraction (setting Kij = 1 with 
probability V) has been replaced by an average attraction 
(setting all Kij = Vk for \i — j\ > 1). Shown are the RLM 
reference curve for V = 5 x 1(F 5 (grey line), the corresponding 
system with average attraction (black line) and two systems 
with smaller average attraction. 



over different configurations of loops. Keeping the loop- 
ing probability V constant in the ensemble. Eq. © gives 
a general expression for the mean square physical dis- 
tance within the thermal ensemble between two arbitrary 
chosen beads, when each bead is allowed to interact with 
any other via harmonic potential. We show that (R 2 ) 
only depends on the interaction matrix K, which we have 
introduced in ([6]). Within the Random Loop Model the 
matrix entries (equivalent to random loops) are chosen 
in a random manner to be — k with probability V or 
otherwise. The average over the ensemble of loop config- 
urations then turns out to be equivalent to averaging over 
the specific ensemble of random matrices with fixed V . 
We show that random loops on all length scales explain 
the levelling-off observed in experiment, while restricting 
the loop sizes to only large loops (in the order of 50 - 
100 Mb) or small loops (100 kb - 10 Mb) does not fit the 
data. 

In contrast to the Random- Walk/ Giant-Loop model 
we do not assume fixed-size and regular placed loops. 
The average over the ensemble of different loop configu- 
rations turns out to be necessary to obtain the levelling- 
off observed in experiment. Its importance also becomes 
apparent in fig. HI where the RWGL model would yield 
the same result as the random walk, namely a constant: 

C RWGL = 5/3 

In a recent paper [6j we have shown that on short ge- 
nomic distances the globular state fits the experimental 
data best compared to a self-avoiding and random walk. 
In the present study we have neglected the effect of ex- 
cluded volume. Therefore on short genomic distances 
our model will explain the data inadequately as it shows 
random-walk-like behaviour by design on short contour 
lengths, because for a globular state polymer the expo- 
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nent in eq. ((2|) is v = 1/3, while for a random walk poly- 
mer it is v = 1/2. On larger genomic distances the exis- 
tence of a small number of loops can explain the levelling- 
off without the need for introducing excluded volume in- 
teractions. The role of excluded volume inside the cell 
nucleus on the spatial chromatin properties remains an 
open question, because of the impact of an enzyme called 
topoisomerase II, which is capable of cutting the DNA 
double strand in order to let another strand pass through 
it. This event might well give rise to a random-walk-likc 
statistics, dependent on the frequency of these events. 
The model so far is able to explain some basic features 



of DNA folding revealed in recent experiments @ . Future 
work on this model might encompass for example the 
integration of excluded volume interactions and its effect 
on the mean square displacement. 
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